Improving Encarta Search Engine Performance by Mining User Logs
نویسندگان
چکیده
We propose a data-mining approach that produces generalized query patterns (with generalized keywords) from the raw user logs of the Microsoft Encarta search engine (http://encarta.msn.com). Those query patterns can act as cache of the search engine, improving its performance. The cache of the generalized query patterns is more advantageous than the cache of the most frequent user queries since our patterns are generalized, covering more queries and future queries even those not previously asked. Our method is unique since query patterns discovered reflect the actual dynamic usage and user feedbacks of the search engine, rather than the syntactic linkage structure of web pages (as Google does). Simulation shows that such generalized query patterns improve search engine’s overall speed considerably. The generalized query patterns, when viewed with a graphical user interface, are also helpful to web editors, who can easily discover topics in which users are mostly interested.
منابع مشابه
Mining Generalized Query Patterns from Web Logs
User logs of a popular search engine keep track of user activities including user queries, user click-through from the returned list, and user browsing behaviors. Knowledge about user queries discovered from user logs can improve the performance of the search engine. We propose a data-mining approach that produces generalized query patterns or templates from the raw user logs of a popular comme...
متن کاملMining Web Logs for Actionable Knowledge
Everyday, popular Web sites attract millions of visitors. These visitors leave behind vast amount of Web site traversal information in the form of Web server and query logs. By analyzing these logs, it is possible to discover various kinds of knowledge, which can be applied to improve the performance of Web services. A particularly useful kind of knowledge is knowledge that can be immediately a...
متن کاملI Data Mining Techniques and Analysis of Concept Based User Profiles from Search Engine Logs
Search engine logs are emerging new type of data user profiling component of any personalization interesting opportunities for data mining. Early user profiling work on mining data mostly attempted to discover knowledge at the level of queries based on objects that users are interested in positive preferences but not the objects in negative preferences. In our paper we focus on search engine lo...
متن کاملWeb Log Mining in Search Engines
Search engine logs not only keep navigation information, but also the queries made by their users. In particular, queries to a search engine follow a power-law distribution, which is far from uniform. Clicks and queries can be used to improve the search engine itself in different aspects: user interface, index performance, and answer ranking. In this chapter we present the main ideas and we sho...
متن کاملWeb Usage Mining in Search Engines
Search engine logs not only keep navigation information, but also the queries made by their users. In particular, queries to a search engine follow a power-law distribution, which is far from uniform. Queries and related clicks can be used to improve the search engine itself in different aspects: user interface, index performance, and answer ranking. In this chapter we present some of the main ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IJPRAI
دوره 16 شماره
صفحات -
تاریخ انتشار 2002